Standard Deviation as a Query Hardness Estimator

نویسندگان

  • Joaquín Pérez-Iglesias
  • Lourdes Araujo
چکیده

In this paper a new Query Performance Prediction method is introduced. This method is based on the hypothesis that different score distributions appear for ‘hard’ and ‘easy’ queries. Following we propose a set of measures which try to capture the differences between both types of distributions, focusing on the dispersion degree among the scores. We have applied some variants of the classic standard deviation and have studied methods to find out the most suitable size of the ranking list for these measures. Finally, we present the results obtained performing the experiments on two different data-sets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Wavelet Threshold Estimator of Semiparametric Regression Function with Correlated Errors

Wavelet analysis is one of the useful techniques in mathematics which is used much in statistics science recently. In this paper, in addition to introduce the wavelet transformation, the wavelet threshold estimation of semiparametric regression model with correlated errors with having Gaussian distribution is determined and the convergence ratio of estimator computed. To evaluate the wavelet th...

متن کامل

Algorithms based on sparsity hypotheses for robust estimation of the noise standard deviation in presence of signals with unknown distributions and occurrences

Inmany applications, d-dimensional observations result from the randompresence or absence of random signals in independent and additive white Gaussian noise. An estimate of the noise standard deviation can then be very useful to detect or to estimate these signals, especially when standard likelihood theory cannot apply because of too little prior knowledge about the signal probability distribu...

متن کامل

A Study on the Class of Chain Ratio–type Estimators

 This paper considers the problem of estimating the population mean Ybar of the study variate Y using information on different parameters such as population mean $(bar{X})$, coefficient of variation $(C_x)$, kurtosis  $beta_{2(x)}$, standard deviation $(S_x)$ of the auxiliary variate x and on the correlation coefficient, $rho$, between the study variate $Y$ and the auxiliary variate $...

متن کامل

A Two-Phase Robust Estimation of Process Dispersion Using M-estimator

Parameter estimation is the first step in constructing any control chart. Most estimators of mean and dispersion are sensitive to the presence of outliers. The data may be contaminated by outliers either locally or globally. The exciting robust estimators deal only with global contamination. In this paper a robust estimator for dispersion is proposed to reduce the effect of local contamination ...

متن کامل

Algorithms and Applications for Estimating the Standard Deviation of AWGN when Observations are not Signal-Free

— Consider observations where random signals are randomly present or absent in independent and additive white Gaussian noise (AWGN). By using a recently established limit theorem, we introduce a new estimator for the estimation of the noise standard deviation when the signals are less present than absent and have unknown probability distributions. The bias, the consistency and the minimum attai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010